1 Capital Cities (Urban Mobility)

1.1 Sample Populations

This section covers the descriptive statistics of the four sample populations.

1.1.1 Dhaka 2009

The data were captured on October 1-2, 2009 with a sample size of 42,111 people. The personal data occupies 3.5 Mb of memory.

Data set description:

describe(dhk09.person)
## dhk09.person 
## 
##  6  Variables      42111  Observations
## ---------------------------------------------------------------------------
## pid 
##       n missing  unique 
##   42111       0   42111 
## 
## lowest : 1     2     3     4     5    
## highest: 47486 47489 47491 47492 47493 
## ---------------------------------------------------------------------------
## sex 
##       n missing  unique 
##   42111       0       2 
## 
## Male (22541, 54%), Female (19570, 46%) 
## ---------------------------------------------------------------------------
## age 
##       n missing  unique 
##   42111       0      18 
## 
##           [0, 5[ [5, 10[ [10, 15[ [15, 20[ [20, 25[ [25, 30[ [30, 35[
## Frequency    319    3555     4567     4236     4949     5161     4133
## %              1       8       11       10       12       12       10
##           [35, 40[ [40, 45[ [45, 50[ [50, 55[ [55, 60[ [60, 65[ [65, 70[
## Frequency     4031     3138     2693     2123     1363      972      476
## %               10        7        6        5        3        2        1
##           [70, 75[ [75, 80[ [80, 85[ [85, Inf[
## Frequency      269       97       21         8
## %                1        0        0         0
## ---------------------------------------------------------------------------
## occup 
##       n missing  unique 
##   42111       0       9 
## 
## Unknown (865, 2%), Government service (1925, 5%) 
## Private service (6901, 16%) 
## Business (6468, 15%), Unemployed (841, 2%) 
## Student (13861, 33%), Housewife (10571, 25%) 
## Agriculture, farming (31, 0%) 
## Others (specify) (648, 2%) 
## ---------------------------------------------------------------------------
## occupc 
##       n missing  unique 
##   42111       0       6 
## 
## Blue-collar (31, 0%), White-collar (15294, 36%) 
## Student (13861, 33%) 
## Housewife, househusband (10571, 25%) 
## Unemployed, retired (841, 2%) 
## Other, unknown (1513, 4%) 
## ---------------------------------------------------------------------------
## dist 
##       n missing  unique    Mean     .05     .10     .25     .50     .75 
##   42111       0   41963   11584    1250    1893    3713    7839   15293 
##     .90     .95 
##   25342   33859 
## 
## lowest :      0.00     14.97     30.46     55.55     61.39
## highest: 126406.08 126643.55 135625.47 152718.42 182414.31 
## ---------------------------------------------------------------------------

plot of chunk plotHistDhk09Person

1.1.2 Hanoi 2004

The data were captured on October 1-2, 2004 with a sample size of 34,846 people. The personal data occupies 2.8 Mb of memory.

Data set description:

describe(hni04.person)
## hni04.person 
## 
##  6  Variables      34846  Observations
## ---------------------------------------------------------------------------
## pid 
##       n missing  unique 
##   34846       0   34846 
## 
## lowest : 2     3     4     5     6    
## highest: 47486 47489 47491 47492 47493 
## ---------------------------------------------------------------------------
## sex 
##       n missing  unique 
##   34846       0       2 
## 
## Male (17343, 50%), Female (17503, 50%) 
## ---------------------------------------------------------------------------
## age 
##       n missing  unique 
##   34846       0      18 
## 
##           [0, 5[ [5, 10[ [10, 15[ [15, 20[ [20, 25[ [25, 30[ [30, 35[
## Frequency      1    1033     2265     2594     3521     3507     2957
## %              0       3        7        7       10       10        8
##           [35, 40[ [40, 45[ [45, 50[ [50, 55[ [55, 60[ [60, 65[ [65, 70[
## Frequency     1943     2285     2892     2546     2340     2122     1926
## %                6        7        8        7        7        6        6
##           [70, 75[ [75, 80[ [80, 85[ [85, Inf[
## Frequency     1360      803      433       318
## %                4        2        1         1
## ---------------------------------------------------------------------------
## occup 
##       n missing  unique 
##   34846       0      18 
## 
## [Undocumented_1] (7, 0%) 
## Leader of branches, units (553, 2%) 
## Professional (1417, 4%) 
## Technical and associate professional (1493, 4%) 
## Clerical worker (5250, 15%) 
## Service worker, shop, market sales (578, 2%) 
## Skilled agriculture, forestry, fishery (1802, 5%) 
## Craft and related trades (1372, 4%) 
## Plant, machine operator, assembler (569, 2%) 
## Elementary occupation (1387, 4%) 
## Elementary student (5391, 15%) 
## High school & university student (1896, 5%) 
## Housewife (1012, 3%), Jobless, retired (9642, 28%) 
## [Undocumented_2] (2253, 6%) 
## [Undocumented_3] (80, 0%) 
## [Undocumented_4] (60, 0%), Unknown (84, 0%) 
## ---------------------------------------------------------------------------
## occupc 
##       n missing  unique 
##   34846       0       6 
## 
## Blue-collar (5130, 15%), White-collar (9291, 27%) 
## Student (7287, 21%) 
## Housewife, househusband (1012, 3%) 
## Unemployed, retired (9642, 28%) 
## Other, unknown (2484, 7%) 
## ---------------------------------------------------------------------------
## dist 
##       n missing  unique    Mean     .05     .10     .25     .50     .75 
##   34846       0   34754   11522    1289    1938    3774    7880   15211 
##     .90     .95 
##   24994   33378 
## 
## lowest :      0.00     14.97     30.46     55.55     61.39
## highest: 126406.08 126643.55 135625.47 152718.42 182414.31 
## ---------------------------------------------------------------------------

plot of chunk plotHistHni04Person

1.1.3 Jakarta 2002

The data were captured on October 1-2, 2002 with a sample size of 36,713 people. The personal data occupies 2.9 Mb of memory.

Data set description:

describe(jkt02.person)
## jkt02.person 
## 
##  6  Variables      36713  Observations
## ---------------------------------------------------------------------------
## pid 
##       n missing  unique 
##   36713       0   36713 
## 
## lowest : 1     2     3     4     6    
## highest: 47486 47489 47491 47492 47493 
## ---------------------------------------------------------------------------
## sex 
##       n missing  unique 
##   36713       0       2 
## 
## Male (19196, 52%), Female (17517, 48%) 
## ---------------------------------------------------------------------------
## age 
##       n missing  unique 
##   36713       0      17 
## 
##           [5, 10[ [10, 15[ [15, 20[ [20, 25[ [25, 30[ [30, 35[ [35, 40[
## Frequency    2964     3489     4190     4502     5135     3671     3400
## %               8       10       11       12       14       10        9
##           [40, 45[ [45, 50[ [50, 55[ [55, 60[ [60, 65[ [65, 70[ [70, 75[
## Frequency     2722     2431     1769      732      783      532      243
## %                7        7        5        2        2        1        1
##           [75, 80[ [80, 85[ [85, Inf[
## Frequency       45       58        47
## %                0        0         0
## ---------------------------------------------------------------------------
## occup 
##       n missing  unique 
##   36713       0      18 
## 
## Unknown (20723, 56%) 
## Professor, manager, director, etc. (940, 3%) 
## Industry owner, retail owner, etc. (1110, 3%) 
## Engineer, doctor, accountant, pilot, etc. (1340, 4%) 
## Teacher, lecturer (940, 3%) 
## Administration staff (2679, 7%) 
## Technician (909, 2%), Waitress (1778, 5%) 
## Vedor, salesman, etc. (1594, 4%) 
## Handyman (carpenter, jewelry) (1206, 3%) 
## Construction, station, port, warehouse (441, 1%) 
## Laborer (190, 1%), Public transport driver (186, 1%) 
## Private driver (751, 2%) 
## Housekeeper, office boy, gardener, etc. (559, 2%) 
## Farmer, fisherman, etc. (5, 0%) 
## Security guard (666, 2%), Others (696, 2%) 
## ---------------------------------------------------------------------------
## occupc 
##       n missing  unique 
##   36713       0       4 
## 
## Blue-collar (4913, 13%), White-collar (9441, 26%) 
## Student (940, 3%), Other, unknown (21419, 58%) 
## ---------------------------------------------------------------------------
## dist 
##       n missing  unique    Mean     .05     .10     .25     .50     .75 
##   36713       0   36604   11592    1262    1904    3717    7841   15313 
##     .90     .95 
##   25394   34016 
## 
## lowest :      0.00     14.97     30.46     55.55     61.39
## highest: 124771.38 126406.08 126643.55 135625.47 182414.31 
## ---------------------------------------------------------------------------

plot of chunk plotHistJkt02Person

1.1.4 Metro Manila 1996

## [1] TRUE
## [1] TRUE
## [1] TRUE
## [1] TRUE

The data were captured on October 1-2, 1996 with a sample size of 189,335 people. The personal data occupies 15.2 Mb of memory.

Data set description:

describe(mnl96.person)
## mnl96.person 
## 
##  6  Variables      189335  Observations
## ---------------------------------------------------------------------------
## pid 
##       n missing  unique 
##  189335       0  189335 
## 
## lowest : 1      2      3      4      5     
## highest: 203517 203518 203519 203520 203521 
## ---------------------------------------------------------------------------
## sex 
##       n missing  unique 
##  189335       0       2 
## 
## Male (87768, 46%), Female (101567, 54%) 
## ---------------------------------------------------------------------------
## age 
##       n missing  unique 
##  189335       0      18 
## 
##           [0, 5[ [5, 10[ [10, 15[ [15, 20[ [20, 25[ [25, 30[ [30, 35[
## Frequency   2617   22125    21710    22320    20383    18998    17071
## %              1      12       11       12       11       10        9
##           [35, 40[ [40, 45[ [45, 50[ [50, 55[ [55, 60[ [60, 65[ [65, 70[
## Frequency    15662    13426    11169     8274     5951     4405     2943
## %                8        7        6        4        3        2        2
##           [70, 75[ [75, 80[ [80, 85[ [85, Inf[
## Frequency     1138      597      331       215
## %                1        0        0         0
## ---------------------------------------------------------------------------
## occup 
##       n missing  unique 
##  189335       0      14 
## 
##           Executive Professional Technician Clerical Service Farmer Trade
## Frequency     18292         5933       3661     5065   10885   1309  5608
## %                10            3          2        3       6      1     3
##           Operator Laborer Pupil Student Housewife Jobless Others
## Frequency     7034   13639 35029   25624     30402   19559   7295
## %                4       7    19      14        16      10      4
## ---------------------------------------------------------------------------
## occupc 
##       n missing  unique 
##  189335       0       6 
## 
## Blue-collar (18609, 10%) 
## White-collar (52817, 28%), Student (60653, 32%) 
## Housewife, househusband (30402, 16%) 
## Unemployed, retired (19559, 10%) 
## Other, unknown (7295, 4%) 
## ---------------------------------------------------------------------------
## 
## Variables with all observations missing:
## 
## [1] dist
## Warning: Removed 87768 rows containing non-finite values (stat_density).
## Warning: Removed 101567 rows containing non-finite values (stat_density).
## Warning: Removed 2617 rows containing non-finite values (stat_density).
## Warning: Removed 22125 rows containing non-finite values (stat_density).
## Warning: Removed 21710 rows containing non-finite values (stat_density).
## Warning: Removed 22320 rows containing non-finite values (stat_density).
## Warning: Removed 20383 rows containing non-finite values (stat_density).
## Warning: Removed 18998 rows containing non-finite values (stat_density).
## Warning: Removed 17071 rows containing non-finite values (stat_density).
## Warning: Removed 15662 rows containing non-finite values (stat_density).
## Warning: Removed 13426 rows containing non-finite values (stat_density).
## Warning: Removed 11169 rows containing non-finite values (stat_density).
## Warning: Removed 8274 rows containing non-finite values (stat_density).
## Warning: Removed 5951 rows containing non-finite values (stat_density).
## Warning: Removed 4405 rows containing non-finite values (stat_density).
## Warning: Removed 2943 rows containing non-finite values (stat_density).
## Warning: Removed 1138 rows containing non-finite values (stat_density).
## Warning: Removed 597 rows containing non-finite values (stat_density).
## Warning: Removed 331 rows containing non-finite values (stat_density).
## Warning: Removed 215 rows containing non-finite values (stat_density).
## Warning: Removed 18609 rows containing non-finite values (stat_density).
## Warning: Removed 52817 rows containing non-finite values (stat_density).
## Warning: Removed 60653 rows containing non-finite values (stat_density).
## Warning: Removed 30402 rows containing non-finite values (stat_density).
## Warning: Removed 19559 rows containing non-finite values (stat_density).
## Warning: Removed 7295 rows containing non-finite values (stat_density).

plot of chunk plotHistMnl96Person

1.1.5 Comparison Across Cities

1.1.5.1 Sex

plot of chunk plotHistPersonSexAll

1.1.5.2 Age

plot of chunk plotHistPersonAgeAll

1.1.5.3 Occupation

plot of chunk plotHistPersonOccupAll

1.1.5.4 Occupation Group

plot of chunk plotHistPersonOccupCAll

1.1.5.5 Distance Traveled

## Warning: Removed 87768 rows containing non-finite values (stat_density).
## Warning: Removed 101567 rows containing non-finite values (stat_density).

plot of chunk plotHistPersonDistAll

## Warning: Removed 2617 rows containing non-finite values (stat_density).
## Warning: Removed 22125 rows containing non-finite values (stat_density).
## Warning: Removed 21710 rows containing non-finite values (stat_density).
## Warning: Removed 22320 rows containing non-finite values (stat_density).
## Warning: Removed 20383 rows containing non-finite values (stat_density).
## Warning: Removed 18998 rows containing non-finite values (stat_density).
## Warning: Removed 17071 rows containing non-finite values (stat_density).
## Warning: Removed 15662 rows containing non-finite values (stat_density).
## Warning: Removed 13426 rows containing non-finite values (stat_density).
## Warning: Removed 11169 rows containing non-finite values (stat_density).
## Warning: Removed 8274 rows containing non-finite values (stat_density).
## Warning: Removed 5951 rows containing non-finite values (stat_density).
## Warning: Removed 4405 rows containing non-finite values (stat_density).
## Warning: Removed 2943 rows containing non-finite values (stat_density).
## Warning: Removed 1138 rows containing non-finite values (stat_density).
## Warning: Removed 597 rows containing non-finite values (stat_density).
## Warning: Removed 331 rows containing non-finite values (stat_density).
## Warning: Removed 215 rows containing non-finite values (stat_density).

plot of chunk plotHistPersonDistAll

## Warning: Removed 18609 rows containing non-finite values (stat_density).
## Warning: Removed 52817 rows containing non-finite values (stat_density).
## Warning: Removed 60653 rows containing non-finite values (stat_density).
## Warning: Removed 30402 rows containing non-finite values (stat_density).
## Warning: Removed 19559 rows containing non-finite values (stat_density).
## Warning: Removed 7295 rows containing non-finite values (stat_density).

plot of chunk plotHistPersonDistAll

1.2 Sample Trips

1.2.1 Dhaka 2009

The 42,111 people in the sample made a total of 105,121 non-stationary trips. The trip data occupies 5.9 Mb of memory.

Data set description:

describe(dhk09.trip)
## dhk09.trip 
## 
##  6  Variables      105121  Observations
## ---------------------------------------------------------------------------
## pid 
##       n missing  unique 
##  105121       0   42098 
## 
## lowest : 1     2     3     4     5    
## highest: 47486 47489 47491 47492 47493 
## ---------------------------------------------------------------------------
## tno 
##       n missing  unique 
##  105121       0      18 
## 
##           1     2   3     4   5     6   7    8  9   10 11  12 13 14 15 16
## Frequency 2 42096 356 41689 184 10873 202 7850 63 1311 27 374  7 58  2 21
## %         0    40   0    40   0    10   0    7  0    1  0   0  0  0  0  0
##           18 20
## Frequency  4  2
## %          0  0
## ---------------------------------------------------------------------------
## activity 
##       n missing  unique 
##  105121       0       7 
## 
##            Home  Work Education Shopping Social, recreation Errand
## Frequency 45257 23676     13455     6455               6775   1851
## %            43    23        13        6                  6      2
##           Other, unknown
## Frequency           7652
## %                      7
## ---------------------------------------------------------------------------
## dist 
##       n missing  unique    Mean     .05     .10     .25     .50     .75 
##  105121       0  103368    4641   500.1   765.3  1511.2  3064.1  6039.5 
##     .90     .95 
## 10120.5 13780.7 
## 
## lowest :     4.232     5.451     7.485     7.485     8.853
## highest: 56457.580 58704.836 61913.059 62614.951 66284.393 
## ---------------------------------------------------------------------------
## pdate_s 
##                   n             missing              unique 
##              105121                   0                 424 
##                Mean                 .05                 .10 
## 2009-10-01 12:05:45 2009-10-01 06:00:00 2009-10-01 07:00:00 
##                 .25                 .50                 .75 
## 2009-10-01 08:10:00 2009-10-01 11:30:00 2009-10-01 16:00:00 
##                 .90                 .95 
## 2009-10-01 18:45:00 2009-10-01 20:00:00 
## 
## lowest : 2009-10-01 00:00:00 2009-10-01 00:21:00 2009-10-01 00:31:00 2009-10-01 00:32:00 2009-10-01 00:33:00
## highest: 2009-10-01 23:20:00 2009-10-01 23:25:00 2009-10-01 23:29:00 2009-10-01 23:35:00 2009-10-01 23:39:00 
## ---------------------------------------------------------------------------
## pdate_e 
##                   n             missing              unique 
##              105121                   0                1069 
##                Mean                 .05                 .10 
## 2009-10-01 12:43:30 2009-10-01 06:30:00 2009-10-01 07:30:00 
##                 .25                 .50                 .75 
## 2009-10-01 09:00:00 2009-10-01 12:06:00 2009-10-01 16:28:00 
##                 .90                 .95 
## 2009-10-01 19:27:00 2009-10-01 20:39:00 
## 
## lowest : 2009-10-01 01:00:00 2009-10-01 01:04:00 2009-10-01 01:05:00 2009-10-01 01:07:00 2009-10-01 01:09:00
## highest: 2009-10-01 23:47:00 2009-10-01 23:49:00 2009-10-01 23:50:00 2009-10-01 23:54:00 2009-10-02 06:00:00 
## ---------------------------------------------------------------------------

plot of chunk plotHistDhk09Trip

1.2.2 Hanoi 2004

The 34,846 people in the sample made a total of 122,530 non-stationary trips. The trip data occupies 6.5 Mb of memory.

Data set description:

describe(hni04.trip)
## hni04.trip 
## 
##  6  Variables      122530  Observations
## ---------------------------------------------------------------------------
## pid 
##       n missing  unique 
##  122530       0   42908 
## 
## lowest : 2     4     6     7     8    
## highest: 75912 75913 75914 75915 75918 
## ---------------------------------------------------------------------------
## tno 
##       n missing  unique 
##  122530       0      28 
## 
## lowest : 1  2  3  4  5 , highest: 24 25 26 27 28 
## ---------------------------------------------------------------------------
## activity 
##       n missing  unique 
##  122502      28       7 
## 
##            Home  Work Education Shopping Social, recreation Errand
## Frequency 56799 27966     13628     8065               4903  11011
## %            46    23        11        7                  4      9
##           Other, unknown
## Frequency            130
## %                      0
## ---------------------------------------------------------------------------
## dist 
##       n missing  unique    Mean     .05     .10     .25     .50     .75 
##  122530       0   76956    3908   327.2   540.9  1160.5  2609.3  5098.8 
##     .90     .95 
##  8570.0 11436.6 
## 
## lowest :     9.199     9.202     9.207     9.220     9.221
## highest: 49427.900 51578.803 59683.450 82004.481 97926.501 
## ---------------------------------------------------------------------------
## pdate_s 
##                   n             missing              unique 
##              122530                   0                 434 
##                Mean                 .05                 .10 
## 2004-10-01 11:53:06 2004-10-01 06:15:00 2004-10-01 06:45:00 
##                 .25                 .50                 .75 
## 2004-10-01 07:30:00 2004-10-01 11:20:00 2004-10-01 16:30:00 
##                 .90                 .95 
## 2004-10-01 17:55:00 2004-10-01 19:30:00 
## 
## lowest : 2004-10-01 00:00:00 2004-10-01 01:00:00 2004-10-01 01:15:00 2004-10-01 01:30:00 2004-10-01 02:00:00
## highest: 2004-10-01 23:20:00 2004-10-01 23:25:00 2004-10-01 23:30:00 2004-10-01 23:40:00 2004-10-01 23:45:00 
## ---------------------------------------------------------------------------
## pdate_e 
##                   n             missing              unique 
##              122530                   0                 788 
##                Mean                 .05                 .10 
## 2004-10-01 12:11:44 2004-10-01 06:40:00 2004-10-01 07:00:00 
##                 .25                 .50                 .75 
## 2004-10-01 07:45:00 2004-10-01 11:40:00 2004-10-01 16:50:00 
##                 .90                 .95 
## 2004-10-01 18:10:00 2004-10-01 19:45:00 
## 
## lowest : 2004-10-01 00:05:00 2004-10-01 00:15:00 2004-10-01 01:10:00 2004-10-01 01:15:00 2004-10-01 01:20:00
## highest: 2004-10-01 23:40:00 2004-10-01 23:45:00 2004-10-01 23:50:00 2004-10-01 23:55:00 2004-10-01 23:59:00 
## ---------------------------------------------------------------------------

plot of chunk plotHistHni04Trip

1.2.3 Jakarta 2002

The 36,713 people in the sample made a total of 746,149 non-stationary trips. The trip data occupies 41.5 Mb of memory.

Data set description:

describe(jkt02.trip)
## jkt02.trip 
## 
##  6  Variables      746149  Observations
## ---------------------------------------------------------------------------
## pid 
##       n missing  unique 
##  746149       0  297042 
## 
## lowest : 1      2      3      4      6     
## highest: 416994 417004 417009 417027 417029 
## ---------------------------------------------------------------------------
## tno 
##       n missing  unique 
##  746149       0      26 
## 
## lowest : 1  2  3  4  5 , highest: 24 26 28 30 32 
## ---------------------------------------------------------------------------
## activity 
##       n missing  unique 
##  746149       0       6 
## 
##             Home   Work Education Shopping Errand Other, unknown
## Frequency 334970 152190    107652    53452  77765          20120
## %             45     20        14        7     10              3
## ---------------------------------------------------------------------------
## dist 
##       n missing  unique    Mean     .05     .10     .25     .50     .75 
##  746149       0  714595   14836   781.8  1328.5  3619.3 11679.2 22135.0 
##     .90     .95 
## 32850.5 41483.7 
## 
## lowest : 1.827e+00 2.715e+00 2.891e+00 2.895e+00 2.901e+00
## highest: 1.123e+05 1.134e+05 1.146e+05 1.152e+05 1.240e+05 
## ---------------------------------------------------------------------------
## pdate_s 
##                   n             missing              unique 
##              746149                   0                 984 
##                Mean                 .05                 .10 
## 2002-10-01 11:42:47 2002-10-01 06:00:00 2002-10-01 06:30:00 
##                 .25                 .50                 .75 
## 2002-10-01 07:15:00 2002-10-01 11:50:00 2002-10-01 16:00:00 
##                 .90                 .95 
## 2002-10-01 17:50:00 2002-10-01 19:05:00 
## 
## lowest : 2002-10-01 00:00:00 2002-10-01 00:08:00 2002-10-01 00:09:00 2002-10-01 00:30:00 2002-10-01 01:00:00
## highest: 2002-10-01 23:40:00 2002-10-01 23:45:00 2002-10-01 23:50:00 2002-10-01 23:55:00 2002-10-01 23:57:00 
## ---------------------------------------------------------------------------
## pdate_e 
##                   n             missing              unique 
##              746149                   0                1206 
##                Mean                 .05                 .10 
## 2002-10-01 12:13:39 2002-10-01 06:40:00 2002-10-01 06:50:00 
##                 .25                 .50                 .75 
## 2002-10-01 08:00:00 2002-10-01 12:05:00 2002-10-01 16:15:00 
##                 .90                 .95 
## 2002-10-01 18:30:00 2002-10-01 19:58:00 
## 
## lowest : 2002-10-01 00:02:00 2002-10-01 00:05:00 2002-10-01 00:10:00 2002-10-01 00:15:00 2002-10-01 00:20:00
## highest: 2002-10-01 23:50:00 2002-10-01 23:54:00 2002-10-01 23:55:00 2002-10-01 23:58:00 2002-10-01 23:59:00 
## ---------------------------------------------------------------------------

plot of chunk plotHistJkt02Trip

1.2.4 Metro Manila 1996

## [1] TRUE
## [1] TRUE
## [1] TRUE
## [1] TRUE

The 189,335 people in the sample made a total of 324,312 non-stationary trips. The trip data occupies 13.9 Mb of memory.

Data set description:

describe(mnl96.trip)
## mnl96.trip 
## 
##  4  Variables      324312  Observations
## ---------------------------------------------------------------------------
## pid 
##       n missing  unique 
##  324312       0  144786 
## 
## lowest : 1      3      4      5      6     
## highest: 203517 203518 203519 203520 203521 
## ---------------------------------------------------------------------------
## tno 
##       n missing  unique 
##  324312       0      38 
## 
## lowest : 1  2  3  4  5 , highest: 34 35 36 37 39 
## ---------------------------------------------------------------------------
## activity 
##       n missing  unique 
##  324312       0       7 
## 
##             Home  Work Education Shopping Social, recreation Errand
## Frequency 152507 69545     57706    21903              12531   7397
## %             47    21        18        7                  4      2
##           Other, unknown
## Frequency           2723
## %                      1
## ---------------------------------------------------------------------------
## 
## Variables with all observations missing:
## 
## [1] dist

plot of chunk plotHistMnl96Trip

1.2.5 Comparison Across Cities

plot of chunk plotHistTripActivityAll

1.3 Sample Subtrips

1.3.1 Dhaka 2009

The 42,111 people in the sample made a total of 314,029 subtrips. 46.7% of these are actually stationarity events, which reduces the true number of subtrips to 167,354. The subtrip data occupies 5.9 Mb of memory.

Data set description:

describe(dhk09.subtrip)
## dhk09.subtrip 
## 
##  9  Variables      167354  Observations
## ---------------------------------------------------------------------------
## pid 
##       n missing  unique 
##  167354       0   42098 
## 
## lowest : 1     2     3     4     5    
## highest: 47486 47489 47491 47492 47493 
## ---------------------------------------------------------------------------
## tno 
##       n missing  unique 
##  167354       0      18 
## 
##           1     2   3     4   5     6   7     8   9   10 11  12 13 14 15
## Frequency 6 68427 586 64701 254 18095 274 12311 101 1913 42 534  8 67  2
## %         0    41   0    39   0    11   0     7   0    1  0   0  0  0  0
##           16 18 20
## Frequency 26  5  2
## %          0  0  0
## ---------------------------------------------------------------------------
## sno 
##       n missing  unique 
##  167354       0       5 
## 
##                1     2     3    4   5
## Frequency 105121 44055 16012 2009 157
## %             63    26    10    1   0
## ---------------------------------------------------------------------------
## tmode 
##       n missing  unique 
##  167354       0      18 
## 
## Walking (65527, 39%), Bicycle (410, 0%) 
## Motorcycle (1529, 1%), CNG, mishuk (5684, 3%) 
## Auto temp (2163, 1%), Car (5348, 3%) 
## Microbus, jeep (465, 0%) 
## Minibus, bus (private) (10785, 6%) 
## Staff bus (908, 1%), School, college bus (686, 0%) 
## Truck (57, 0%), Auto (private) (52, 0%) 
## School van (1345, 1%), AC bus (56, 0%) 
## Rickshaw (54514, 33%), Taxi (273, 0%) 
## Bus (BRTC) (587, 0%), Bus (not BRTC) (16965, 10%) 
## ---------------------------------------------------------------------------
## tmodec 
##       n missing  unique 
##  167354       0       7 
## 
##           Walking Bicycle Motorcycle  Taxi  Car Truck   Bus
## Frequency   65527     410       1529 60471 7563    57 31797
## %              39       0          1    36    5     0    19
## ---------------------------------------------------------------------------
## activity 
##       n missing  unique 
##  167354       0       7 
## 
##            Home  Work Education Shopping Social, recreation Errand
## Frequency 70455 40388     19804    10308              12183   3227
## %            42    24        12        6                  7      2
##           Other, unknown
## Frequency          10989
## %                      7
## ---------------------------------------------------------------------------
## dist 
##       n missing  unique    Mean     .05     .10     .25     .50     .75 
##  167354       0  154776    2915   426.9   636.7  1153.6  2060.8  3550.9 
##     .90     .95 
##  5873.1  8358.5 
## 
## lowest :     3.640     4.232     5.451     7.485     7.485
## highest: 50500.481 50500.493 52381.155 58704.836 66284.393 
## ---------------------------------------------------------------------------
## pdate_s 
##                   n             missing              unique 
##              167354                   0                1295 
##                Mean                 .05                 .10 
## 2009-10-01 12:16:52 2009-10-01 06:00:00 2009-10-01 07:00:00 
##                 .25                 .50                 .75 
## 2009-10-01 08:30:00 2009-10-01 11:44:00 2009-10-01 16:00:00 
##                 .90                 .95 
## 2009-10-01 19:00:00 2009-10-01 20:00:00 
## 
## lowest : 2009-10-01 00:00:00 2009-10-01 00:20:00 2009-10-01 00:21:00 2009-10-01 00:31:00 2009-10-01 00:32:00
## highest: 2009-10-01 23:30:00 2009-10-01 23:35:00 2009-10-01 23:37:00 2009-10-01 23:39:00 2009-10-02 01:30:00 
## ---------------------------------------------------------------------------
## pdate_e 
##                   n             missing              unique 
##              167354                   0                1326 
##                Mean                 .05                 .10 
## 2009-10-01 12:40:36 2009-10-01 06:18:00 2009-10-01 07:29:00 
##                 .25                 .50                 .75 
## 2009-10-01 08:52:00 2009-10-01 12:06:00 2009-10-01 16:29:00 
##                 .90                 .95 
## 2009-10-01 19:20:00 2009-10-01 20:30:00 
## 
## lowest : 2009-10-01 00:20:00 2009-10-01 00:40:00 2009-10-01 00:59:00 2009-10-01 01:00:00 2009-10-01 01:03:00
## highest: 2009-10-01 23:49:00 2009-10-01 23:50:00 2009-10-01 23:54:00 2009-10-02 01:30:00 2009-10-02 06:00:00 
## ---------------------------------------------------------------------------

plot of chunk plotHistDhk09Subtrip

1.3.2 Hanoi 2004

The 34,846 people in the sample made a total of 302,813 subtrips. 59.0% of these are actually stationarity events, which reduces the true number of subtrips to 124,062. The subtrip data occupies 6.5 Mb of memory.

Data set description:

describe(hni04.subtrip)
## hni04.subtrip 
## 
##  9  Variables      124062  Observations
## ---------------------------------------------------------------------------
## pid 
##       n missing  unique 
##  124062       0   42908 
## 
## lowest : 2     4     6     7     8    
## highest: 75912 75913 75914 75915 75918 
## ---------------------------------------------------------------------------
## tno 
##       n missing  unique 
##  124062       0      28 
## 
## lowest : 1  2  3  4  5 , highest: 24 25 26 27 28 
## ---------------------------------------------------------------------------
## sno 
##       n missing  unique 
##  124062       0       4 
## 
## 1 (122530, 99%), 2 (1264, 1%), 3 (134, 0%), 4 (134, 0%) 
## ---------------------------------------------------------------------------
## tmode 
##       n missing  unique 
##  124062       0      15 
## 
## Walking (35301, 28%), Bicycle (26091, 21%) 
## Motorcycle (driver) (49217, 40%) 
## Motorcycle (passenger) (7182, 6%) 
## Car (driver) (599, 0%) 
## Car (passenger, not taxi) (578, 0%) 
## Minibus (<=25 pax) (326, 0%) 
## Standard bus (>25 pax) (2983, 2%) 
## Cyclo (47, 0%), Xe om (736, 1%), Taxi (187, 0%) 
## Tourist bus (68, 0%), Company bus (331, 0%) 
## School bus (114, 0%), Truck (302, 0%) 
## ---------------------------------------------------------------------------
## tmodec 
##       n missing  unique 
##  124062       0       7 
## 
##           Walking Bicycle Motorcycle Taxi  Car Truck  Bus
## Frequency   35301   26091      56399  970 1177   302 3822
## %              28      21         45    1    1     0    3
## ---------------------------------------------------------------------------
## activity 
##       n missing  unique 
##  124034      28       7 
## 
##            Home  Work Education Shopping Social, recreation Errand
## Frequency 57542 28254     13994     8093               4920  11097
## %            46    23        11        7                  4      9
##           Other, unknown
## Frequency            134
## %                      0
## ---------------------------------------------------------------------------
## dist 
##       n missing  unique    Mean     .05     .10     .25     .50     .75 
##  124062       0   78095    3860   326.6   537.6  1151.4  2588.8  5060.6 
##     .90     .95 
##  8466.1 11274.8 
## 
## lowest :     6.231     9.199     9.202     9.207     9.220
## highest: 46313.503 47065.393 47140.065 49427.900 51578.803 
## ---------------------------------------------------------------------------
## pdate_s 
##                   n             missing              unique 
##              124062                   0                 624 
##                Mean                 .05                 .10 
## 2004-10-01 11:53:13 2004-10-01 06:15:00 2004-10-01 06:40:00 
##                 .25                 .50                 .75 
## 2004-10-01 07:30:00 2004-10-01 11:25:00 2004-10-01 16:30:00 
##                 .90                 .95 
## 2004-10-01 17:55:00 2004-10-01 19:30:00 
## 
## lowest : 2004-10-01 00:00:00 2004-10-01 01:00:00 2004-10-01 01:15:00 2004-10-01 01:30:00 2004-10-01 02:00:00
## highest: 2004-10-01 23:20:00 2004-10-01 23:25:00 2004-10-01 23:30:00 2004-10-01 23:40:00 2004-10-01 23:45:00 
## ---------------------------------------------------------------------------
## pdate_e 
##                   n             missing              unique 
##              124062                   0                 873 
##                Mean                 .05                 .10 
## 2004-10-01 12:11:37 2004-10-01 06:35:00 2004-10-01 07:00:00 
##                 .25                 .50                 .75 
## 2004-10-01 07:45:00 2004-10-01 11:40:00 2004-10-01 16:50:00 
##                 .90                 .95 
## 2004-10-01 18:10:00 2004-10-01 19:45:00 
## 
## lowest : 2004-10-01 00:05:00 2004-10-01 00:15:00 2004-10-01 01:10:00 2004-10-01 01:15:00 2004-10-01 01:20:00
## highest: 2004-10-01 23:40:00 2004-10-01 23:45:00 2004-10-01 23:50:00 2004-10-01 23:55:00 2004-10-01 23:59:00 
## ---------------------------------------------------------------------------

plot of chunk plotHistHni04Subtrip

1.3.3 Jakarta 2002

The 36,713 people in the sample made a total of 2,101,280 subtrips. 49.6% of these are actually stationarity events, which reduces the true number of subtrips to 1,058,822.

Data set description:

format(object.size(jkt02.subtrip), units = "auto")
## [1] "68.4 Mb"
names(jkt02.subtrip)
## [1] "pid"      "tno"      "sno"      "tmode"    "tmodec"   "activity"
## [7] "dist"     "pdate_s"  "pdate_e"
str(jkt02.subtrip)
## 'data.frame':    1058822 obs. of  9 variables:
##  $ pid     : Factor w/ 297043 levels "1","2","3","4",..: 1 1 1 1 1 1 1 2 2 3 ...
##  $ tno     : Factor w/ 33 levels "1","2","3","4",..: 2 4 6 8 10 12 14 2 4 2 ...
##  $ sno     : Factor w/ 6 levels "1","2","3","4",..: 1 1 1 1 1 1 1 1 1 1 ...
##  $ tmode   : Factor w/ 23 levels "Walking to final destination",..: 5 5 5 5 5 5 5 1 1 1 ...
##  $ tmodec  : Factor w/ 9 levels "Walking","Bicycle",..: 5 5 5 5 5 5 5 1 1 1 ...
##  $ activity: Factor w/ 7 levels "Home","Work",..: 2 4 2 4 2 4 1 7 4 7 ...
##  $ dist    : num  3929 2428 2088 5709 9023 ...
##  $ pdate_s : POSIXct, format: "2002-10-01 07:30:00" "2002-10-01 09:00:00" ...
##  $ pdate_e : POSIXct, format: "2002-10-01 08:05:00" "2002-10-01 09:10:00" ...
levels(jkt02.subtrip$tmodec)
## [1] "Walking"    "Bicycle"    "Motorcycle" "Taxi"       "Car"       
## [6] "Truck"      "Bus"        "Railway"    "Boat"
table(jkt02.subtrip$tmodec)
## 
##    Walking    Bicycle Motorcycle       Taxi        Car      Truck 
##     478734      17169     113412      63061      54107        636 
##        Bus    Railway       Boat 
##     331703          0          0
levels(jkt02.subtrip$activity)
## [1] "Home"               "Work"               "Education"         
## [4] "Shopping"           "Social, recreation" "Errand"            
## [7] "Other, unknown"
table(jkt02.subtrip$activity)
## 
##               Home               Work          Education 
##             480509             229632             157506 
##           Shopping Social, recreation             Errand 
##              70339                  0              97241 
##     Other, unknown 
##              23595

plot of chunk plotHistJkt02Subtrip

1.3.4 Metro Manila 1996

## [1] TRUE
## [1] TRUE
## [1] TRUE
## [1] TRUE

The 189,335 people in the sample made a total of 1,077,258 subtrips. 47.5% of these are actually stationarity events, which reduces the true number of subtrips to 565,041.

Data set description:

format(object.size(mnl96.subtrip), units = "auto")
## [1] "27.4 Mb"
names(mnl96.subtrip)
## [1] "pid"      "tno"      "sno"      "tmode"    "tmodec"   "activity"
## [7] "dist"
str(mnl96.subtrip)
## 'data.frame':    565041 obs. of  7 variables:
##  $ pid     : Factor w/ 189335 levels "1","2","3","4",..: 1 1 3 3 3 3 3 3 4 4 ...
##  $ tno     : Factor w/ 40 levels "1","2","3","4",..: 2 4 2 2 2 4 4 4 2 4 ...
##  $ sno     : Factor w/ 9 levels "1","2","3","4",..: 1 1 1 2 3 1 2 3 1 1 ...
##  $ tmode   : Factor w/ 21 levels "Walking","Pedicab",..: 1 1 1 6 1 1 6 1 2 1 ...
##  $ tmodec  : Factor w/ 9 levels "Walking","Bicycle",..: 1 1 1 7 1 1 7 1 4 1 ...
##  $ activity: Factor w/ 7 levels "Home","Work",..: NA NA NA NA NA NA NA NA NA NA ...
##  $ dist    : num  NA NA NA NA NA NA NA NA NA NA ...
levels(mnl96.subtrip$tmodec)
## [1] "Walking"    "Bicycle"    "Motorcycle" "Taxi"       "Car"       
## [6] "Truck"      "Bus"        "Railway"    "Boat"
table(mnl96.subtrip$tmodec)
## 
##    Walking    Bicycle Motorcycle       Taxi        Car      Truck 
##     232123       2292      89554      19110      26347       1308 
##        Bus    Railway       Boat 
##     193627          0        680
levels(mnl96.subtrip$activity)
## [1] "Home"               "Work"               "Education"         
## [4] "Shopping"           "Social, recreation" "Errand"            
## [7] "Other, unknown"
table(mnl96.subtrip$activity)
## 
##               Home               Work          Education 
##                  0                  0                  0 
##           Shopping Social, recreation             Errand 
##                  0                  0                  0 
##     Other, unknown 
##                  0

plot of chunk plotHistMnl96Subtrip

1.3.5 Comparison Across Cities

plot of chunk plotHistSubtripTModeCAll

1.4 Commuting Behavior

This section covers the analysis of the commuting behavior in all four cities both in the morning (0:00 - 11:59) and in the evening (12:00 - 23:59) regarding distance, duration, and multi-modality.

1.4.1 Dhaka 2009

Data set description:

dhk09CommBehavSexNoon <- ddply(dhk09.commbehav, c("sex", "noon"), summarize, 
                               mean_dist = mean(dist), 
                               mean_duration = mean(duration), 
                               subtrips = mean(subtrips))
dhk09CommBehavAgeNoon <- ddply(dhk09.commbehav, c("age", "noon"), summarize, 
                               mean_dist = mean(dist), 
                               mean_duration = mean(duration), 
                               subtrips = mean(subtrips))
dhk09CommBehavOccupCNoon <- ddply(dhk09.commbehav, c("occupc", "noon"), summarize, 
                                  mean_dist = mean(dist), 
                                  mean_duration = mean(duration), 
                                  subtrips = mean(subtrips))
print(xtable(dhk09CommBehavSexNoon, type = "html"))
## Warning: class of 'x' was discarded
% latex table generated in R 3.1.0 by xtable 1.7-3 package % Fri Jul 18 18:44:30 2014
print(xtable(dhk09CommBehavAgeNoon, type = "html"))
## Warning: class of 'x' was discarded
% latex table generated in R 3.1.0 by xtable 1.7-3 package % Fri Jul 18 18:44:30 2014
print(xtable(dhk09CommBehavOccupCNoon, type = "html"))
## Warning: class of 'x' was discarded
% latex table generated in R 3.1.0 by xtable 1.7-3 package % Fri Jul 18 18:44:30 2014

plot of chunk plotCommBehavDhk09

1.4.2 Hanoi 2004

Data set description:

hni04CommBehavSexNoon <- ddply(hni04.commbehav, c("sex", "noon"), summarize, 
                               mean_dist = mean(dist), 
                               mean_duration = mean(duration), 
                               subtrips = mean(subtrips))
hni04CommBehavAgeNoon <- ddply(hni04.commbehav, c("age", "noon"), summarize, 
                               mean_dist = mean(dist), 
                               mean_duration = mean(duration), 
                               subtrips = mean(subtrips))
hni04CommBehavOccupCNoon <- ddply(hni04.commbehav, c("occupc", "noon"), summarize, 
                                  mean_dist = mean(dist), 
                                  mean_duration = mean(duration), 
                                  subtrips = mean(subtrips))
print(xtable(hni04CommBehavSexNoon, type = "html"))
## Warning: class of 'x' was discarded
% latex table generated in R 3.1.0 by xtable 1.7-3 package % Fri Jul 18 18:45:08 2014
print(xtable(hni04CommBehavAgeNoon, type = "html"))
## Warning: class of 'x' was discarded
% latex table generated in R 3.1.0 by xtable 1.7-3 package % Fri Jul 18 18:45:08 2014
print(xtable(hni04CommBehavOccupCNoon, type = "html"))
## Warning: class of 'x' was discarded
% latex table generated in R 3.1.0 by xtable 1.7-3 package % Fri Jul 18 18:45:08 2014

plot of chunk plotCommBehavHni04

1.4.3 Jakarta 2002

Data set description:

jkt02CommBehavSexNoon <- ddply(jkt02.commbehav, c("sex", "noon"), summarize, 
                               mean_dist = mean(dist), 
                               mean_duration = mean(duration), 
                               subtrips = mean(subtrips))
jkt02CommBehavAgeNoon <- ddply(jkt02.commbehav, c("age", "noon"), summarize, 
                               mean_dist = mean(dist), 
                               mean_duration = mean(duration), 
                               subtrips = mean(subtrips))
jkt02CommBehavOccupCNoon <- ddply(jkt02.commbehav, c("occupc", "noon"), summarize, 
                                  mean_dist = mean(dist), 
                                  mean_duration = mean(duration), 
                                  subtrips = mean(subtrips))
print(xtable(jkt02CommBehavSexNoon, type = "html"))
## Warning: class of 'x' was discarded
% latex table generated in R 3.1.0 by xtable 1.7-3 package % Fri Jul 18 18:53:29 2014
print(xtable(jkt02CommBehavAgeNoon, type = "html"))
## Warning: class of 'x' was discarded
% latex table generated in R 3.1.0 by xtable 1.7-3 package % Fri Jul 18 18:53:29 2014
print(xtable(jkt02CommBehavOccupCNoon, type = "html"))
## Warning: class of 'x' was discarded
% latex table generated in R 3.1.0 by xtable 1.7-3 package % Fri Jul 18 18:53:29 2014

plot of chunk plotCommBehavJkt02

1.4.4 Metro Manila 1996

1.4.5 Comparison Across Cities

plot of chunk plotCommBehavAll plot of chunk plotCommBehavAll plot of chunk plotCommBehavAll plot of chunk plotCommBehavAll

1.5 Spatio-Temporal Distribution of People

This section covers the analysis of the distribution of people in all four cities over space and time. For analysis purposes we generated concentric ring buffers in 5km distances from the centers of all four cities (as per the OpenStreetMap data).

1.5.1 Dhaka 2009

plot of chunk plotSTDistributionDhk09

1.5.2 Hanoi 2004

plot of chunk plotSTDistributionHni04

1.5.3 Jakarta 2002

plot of chunk plotSTDistributionJkt02

1.5.4 Metro Manila 1996

1.5.5 Comparison Across Cities

plot of chunk plotSTDistAll

1.6 Spatio-Temporal Distribution of Trip Origins

This section covers the analysis of the distribution of the origins of trips in all four cities over space and time. For analysis purposes we generated concentric ring buffers in 5km distances from the centers of all four cities (as per the OpenStreetMap data).

1.7 Final Steps

It’s always a good idea to clean up after you’re done…

## [1] TRUE
## [1] TRUE